Overview

Brought to you by YData

Dataset statistics

Number of variables27
Number of observations899164
Missing cells751259
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory185.2 MiB
Average record size in memory216.0 B

Variable types

Numeric8
Text9
DateTime3
Unsupported1
Categorical6

Alerts

RevLineCr is highly imbalanced (61.3%) Imbalance
LowDoc is highly imbalanced (80.6%) Imbalance
BalanceGross is highly imbalanced (> 99.9%) Imbalance
ChgOffDate has 736465 (81.9%) missing values Missing
NoEmp is highly skewed (γ1 = 80.24824355) Skewed
CreateJob is highly skewed (γ1 = 36.99135473) Skewed
RetainedJob is highly skewed (γ1 = 36.85481184) Skewed
LoanNr_ChkDgt has unique values Unique
ApprovalFY is an unsupported type, check if it needs cleaning or further analysis Unsupported
NAICS has 201948 (22.5%) zeros Zeros
CreateJob has 629248 (70.0%) zeros Zeros
RetainedJob has 440403 (49.0%) zeros Zeros
FranchiseCode has 208835 (23.2%) zeros Zeros

Reproduction

Analysis started2025-02-09 08:22:02.379166
Analysis finished2025-02-09 08:23:28.343758
Duration1 minute and 25.96 seconds
Software versionydata-profiling vv4.12.2
Download configurationconfig.json

Variables

LoanNr_ChkDgt
Real number (ℝ)

Unique 

Distinct899164
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7726123 × 109
Minimum1.000014 × 109
Maximum9.996003 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:28.482920image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1.000014 × 109
5-th percentile1.3484572 × 109
Q12.5897575 × 109
median4.361439 × 109
Q36.9046265 × 109
95-th percentile9.1648039 × 109
Maximum9.996003 × 109
Range8.995989 × 109
Interquartile range (IQR)4.314869 × 109

Descriptive statistics

Standard deviation2.538175 × 109
Coefficient of variation (CV)0.53182091
Kurtosis-1.086499
Mean4.7726123 × 109
Median Absolute Deviation (MAD)2.0134 × 109
Skewness0.3647571
Sum4.2913612 × 1015
Variance6.4423325 × 1018
MonotonicityStrictly increasing
2025-02-09T09:23:28.573713image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9996003010 1
 
< 0.1%
1000014003 1
 
< 0.1%
1000024006 1
 
< 0.1%
1000034009 1
 
< 0.1%
1000044001 1
 
< 0.1%
1000054004 1
 
< 0.1%
1000084002 1
 
< 0.1%
1000093009 1
 
< 0.1%
1000094005 1
 
< 0.1%
1000104006 1
 
< 0.1%
Other values (899154) 899154
> 99.9%
ValueCountFrequency (%)
1000014003 1
< 0.1%
1000024006 1
< 0.1%
1000034009 1
< 0.1%
1000044001 1
< 0.1%
1000054004 1
< 0.1%
1000084002 1
< 0.1%
1000093009 1
< 0.1%
1000094005 1
< 0.1%
1000104006 1
< 0.1%
1000124001 1
< 0.1%
ValueCountFrequency (%)
9996003010 1
< 0.1%
9995973006 1
< 0.1%
9995613003 1
< 0.1%
9995603000 1
< 0.1%
9995573004 1
< 0.1%
9995563001 1
< 0.1%
9995493004 1
< 0.1%
9995473009 1
< 0.1%
9995453003 1
< 0.1%
9995423005 1
< 0.1%

Name
Text

Distinct779583
Distinct (%)86.7%
Missing14
Missing (%)< 0.1%
Memory size6.9 MiB
2025-02-09T09:23:28.994152image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length30
Median length23
Mean length21.775963
Min length1

Characters and Unicode

Total characters19579857
Distinct characters91
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique706468 ?
Unique (%)78.6%

Sample

1st rowABC HOBBYCRAFT
2nd rowLANDMARK BAR & GRILLE (THE)
3rd rowWHITLOCK DDS, TODD M.
4th rowBIG BUCKS PAWN & JEWELRY, LLC
5th rowANASTASIA CONFECTIONS, INC.
ValueCountFrequency (%)
inc 263379
 
8.4%
100280
 
3.2%
llc 77826
 
2.5%
and 28959
 
0.9%
the 28389
 
0.9%
of 23026
 
0.7%
dba 20214
 
0.6%
co 18216
 
0.6%
a 18114
 
0.6%
services 17318
 
0.6%
Other values (226643) 2530176
80.9%
2025-02-09T09:23:29.428424image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2231639
 
11.4%
E 1354056
 
6.9%
I 1226719
 
6.3%
A 1177821
 
6.0%
N 1170319
 
6.0%
R 1052562
 
5.4%
C 1038114
 
5.3%
S 1009495
 
5.2%
O 933206
 
4.8%
T 917437
 
4.7%
Other values (81) 7468489
38.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19579857
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2231639
 
11.4%
E 1354056
 
6.9%
I 1226719
 
6.3%
A 1177821
 
6.0%
N 1170319
 
6.0%
R 1052562
 
5.4%
C 1038114
 
5.3%
S 1009495
 
5.2%
O 933206
 
4.8%
T 917437
 
4.7%
Other values (81) 7468489
38.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19579857
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2231639
 
11.4%
E 1354056
 
6.9%
I 1226719
 
6.3%
A 1177821
 
6.0%
N 1170319
 
6.0%
R 1052562
 
5.4%
C 1038114
 
5.3%
S 1009495
 
5.2%
O 933206
 
4.8%
T 917437
 
4.7%
Other values (81) 7468489
38.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19579857
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2231639
 
11.4%
E 1354056
 
6.9%
I 1226719
 
6.3%
A 1177821
 
6.0%
N 1170319
 
6.0%
R 1052562
 
5.4%
C 1038114
 
5.3%
S 1009495
 
5.2%
O 933206
 
4.8%
T 917437
 
4.7%
Other values (81) 7468489
38.1%

City
Text

Distinct32581
Distinct (%)3.6%
Missing30
Missing (%)< 0.1%
Memory size6.9 MiB
2025-02-09T09:23:29.626576image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length30
Median length27
Mean length9.1030625
Min length1

Characters and Unicode

Total characters8184873
Distinct characters80
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12872 ?
Unique (%)1.4%

Sample

1st rowEVANSVILLE
2nd rowNEW PARIS
3rd rowBLOOMINGTON
4th rowBROKEN ARROW
5th rowORLANDO
ValueCountFrequency (%)
city 23831
 
2.0%
san 21942
 
1.8%
new 16075
 
1.3%
los 13000
 
1.1%
angeles 12380
 
1.0%
lake 10729
 
0.9%
houston 10587
 
0.9%
beach 10462
 
0.9%
park 10316
 
0.9%
york 9724
 
0.8%
Other values (17695) 1066583
88.5%
2025-02-09T09:23:29.918186image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 744405
 
9.1%
E 723098
 
8.8%
O 632510
 
7.7%
N 621338
 
7.6%
L 573578
 
7.0%
R 513614
 
6.3%
S 475392
 
5.8%
I 468344
 
5.7%
T 425108
 
5.2%
306936
 
3.8%
Other values (70) 2700550
33.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8184873
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 744405
 
9.1%
E 723098
 
8.8%
O 632510
 
7.7%
N 621338
 
7.6%
L 573578
 
7.0%
R 513614
 
6.3%
S 475392
 
5.8%
I 468344
 
5.7%
T 425108
 
5.2%
306936
 
3.8%
Other values (70) 2700550
33.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8184873
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 744405
 
9.1%
E 723098
 
8.8%
O 632510
 
7.7%
N 621338
 
7.6%
L 573578
 
7.0%
R 513614
 
6.3%
S 475392
 
5.8%
I 468344
 
5.7%
T 425108
 
5.2%
306936
 
3.8%
Other values (70) 2700550
33.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8184873
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 744405
 
9.1%
E 723098
 
8.8%
O 632510
 
7.7%
N 621338
 
7.6%
L 573578
 
7.0%
R 513614
 
6.3%
S 475392
 
5.8%
I 468344
 
5.7%
T 425108
 
5.2%
306936
 
3.8%
Other values (70) 2700550
33.0%

State
Text

Distinct51
Distinct (%)< 0.1%
Missing14
Missing (%)< 0.1%
Memory size6.9 MiB
2025-02-09T09:23:30.029424image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1798300
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIN
2nd rowIN
3rd rowIN
4th rowOK
5th rowFL
ValueCountFrequency (%)
ca 130619
 
14.5%
tx 70458
 
7.8%
ny 57693
 
6.4%
fl 41212
 
4.6%
pa 35170
 
3.9%
oh 32622
 
3.6%
il 29669
 
3.3%
ma 25272
 
2.8%
mn 24373
 
2.7%
nj 24035
 
2.7%
Other values (41) 428027
47.6%
2025-02-09T09:23:30.191441image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 306176
17.0%
C 184957
10.3%
N 181727
10.1%
M 132549
 
7.4%
T 125069
 
7.0%
I 119518
 
6.6%
O 94906
 
5.3%
L 88819
 
4.9%
X 70458
 
3.9%
Y 68255
 
3.8%
Other values (14) 425866
23.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1798300
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 306176
17.0%
C 184957
10.3%
N 181727
10.1%
M 132549
 
7.4%
T 125069
 
7.0%
I 119518
 
6.6%
O 94906
 
5.3%
L 88819
 
4.9%
X 70458
 
3.9%
Y 68255
 
3.8%
Other values (14) 425866
23.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1798300
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 306176
17.0%
C 184957
10.3%
N 181727
10.1%
M 132549
 
7.4%
T 125069
 
7.0%
I 119518
 
6.6%
O 94906
 
5.3%
L 88819
 
4.9%
X 70458
 
3.9%
Y 68255
 
3.8%
Other values (14) 425866
23.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1798300
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 306176
17.0%
C 184957
10.3%
N 181727
10.1%
M 132549
 
7.4%
T 125069
 
7.0%
I 119518
 
6.6%
O 94906
 
5.3%
L 88819
 
4.9%
X 70458
 
3.9%
Y 68255
 
3.8%
Other values (14) 425866
23.7%

Zip
Real number (ℝ)

Distinct33611
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53804.391
Minimum0
Maximum99999
Zeros283
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:30.270057image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3838
Q127587
median55410
Q383704
95-th percentile95822
Maximum99999
Range99999
Interquartile range (IQR)56117

Descriptive statistics

Standard deviation31184.159
Coefficient of variation (CV)0.5795839
Kurtosis-1.3359893
Mean53804.391
Median Absolute Deviation (MAD)28206
Skewness-0.16816663
Sum4.8378972 × 1010
Variance9.7245178 × 108
MonotonicityNot monotonic
2025-02-09T09:23:30.360073image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10001 933
 
0.1%
90015 926
 
0.1%
93401 806
 
0.1%
90010 733
 
0.1%
33166 671
 
0.1%
90021 666
 
0.1%
59601 640
 
0.1%
65804 599
 
0.1%
3801 581
 
0.1%
59101 578
 
0.1%
Other values (33601) 892031
99.2%
ValueCountFrequency (%)
0 283
< 0.1%
1 24
 
< 0.1%
2 11
 
< 0.1%
3 5
 
< 0.1%
4 5
 
< 0.1%
5 5
 
< 0.1%
6 4
 
< 0.1%
7 6
 
< 0.1%
8 15
 
< 0.1%
9 24
 
< 0.1%
ValueCountFrequency (%)
99999 209
< 0.1%
99950 3
 
< 0.1%
99929 15
 
< 0.1%
99928 1
 
< 0.1%
99926 1
 
< 0.1%
99925 4
 
< 0.1%
99923 1
 
< 0.1%
99921 13
 
< 0.1%
99919 2
 
< 0.1%
99918 1
 
< 0.1%

Bank
Text

Distinct5802
Distinct (%)0.6%
Missing1559
Missing (%)0.2%
Memory size6.9 MiB
2025-02-09T09:23:30.538281image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length30
Median length26
Mean length23.187946
Min length3

Characters and Unicode

Total characters20813616
Distinct characters50
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique923 ?
Unique (%)0.1%

Sample

1st rowFIFTH THIRD BANK
2nd row1ST SOURCE BANK
3rd rowGRANT COUNTY STATE BANK
4th row1ST NATL BK & TR CO OF BROKEN
5th rowFLORIDA BUS. DEVEL CORP
ValueCountFrequency (%)
bank 651608
18.5%
natl 318240
 
9.0%
assoc 306768
 
8.7%
of 142852
 
4.1%
national 125899
 
3.6%
america 100686
 
2.9%
association 84965
 
2.4%
fargo 63732
 
1.8%
wells 63650
 
1.8%
52264
 
1.5%
Other values (3602) 1606709
45.7%
2025-02-09T09:23:30.804310image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 2762231
13.3%
2620014
12.6%
N 2105500
10.1%
S 1520499
 
7.3%
O 1336993
 
6.4%
T 1181841
 
5.7%
C 1134642
 
5.5%
I 1061717
 
5.1%
E 923739
 
4.4%
L 922583
 
4.4%
Other values (40) 5243857
25.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 20813616
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 2762231
13.3%
2620014
12.6%
N 2105500
10.1%
S 1520499
 
7.3%
O 1336993
 
6.4%
T 1181841
 
5.7%
C 1134642
 
5.5%
I 1061717
 
5.1%
E 923739
 
4.4%
L 922583
 
4.4%
Other values (40) 5243857
25.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 20813616
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 2762231
13.3%
2620014
12.6%
N 2105500
10.1%
S 1520499
 
7.3%
O 1336993
 
6.4%
T 1181841
 
5.7%
C 1134642
 
5.5%
I 1061717
 
5.1%
E 923739
 
4.4%
L 922583
 
4.4%
Other values (40) 5243857
25.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 20813616
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 2762231
13.3%
2620014
12.6%
N 2105500
10.1%
S 1520499
 
7.3%
O 1336993
 
6.4%
T 1181841
 
5.7%
C 1134642
 
5.5%
I 1061717
 
5.1%
E 923739
 
4.4%
L 922583
 
4.4%
Other values (40) 5243857
25.2%
Distinct56
Distinct (%)< 0.1%
Missing1566
Missing (%)0.2%
Memory size6.9 MiB
2025-02-09T09:23:30.911183image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1795196
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowOH
2nd rowIN
3rd rowIN
4th rowOK
5th rowFL
ValueCountFrequency (%)
ca 118116
 
13.2%
nc 79514
 
8.9%
il 65908
 
7.3%
oh 58461
 
6.5%
sd 51095
 
5.7%
tx 47790
 
5.3%
ri 45366
 
5.1%
ny 39592
 
4.4%
va 29002
 
3.2%
de 24537
 
2.7%
Other values (46) 338217
37.7%
2025-02-09T09:23:31.082060image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 241398
13.4%
C 229604
12.8%
N 187751
10.5%
I 158854
 
8.8%
O 102604
 
5.7%
L 96914
 
5.4%
D 96078
 
5.4%
T 94941
 
5.3%
M 85034
 
4.7%
S 73385
 
4.1%
Other values (14) 428633
23.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1795196
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 241398
13.4%
C 229604
12.8%
N 187751
10.5%
I 158854
 
8.8%
O 102604
 
5.7%
L 96914
 
5.4%
D 96078
 
5.4%
T 94941
 
5.3%
M 85034
 
4.7%
S 73385
 
4.1%
Other values (14) 428633
23.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1795196
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 241398
13.4%
C 229604
12.8%
N 187751
10.5%
I 158854
 
8.8%
O 102604
 
5.7%
L 96914
 
5.4%
D 96078
 
5.4%
T 94941
 
5.3%
M 85034
 
4.7%
S 73385
 
4.1%
Other values (14) 428633
23.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1795196
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 241398
13.4%
C 229604
12.8%
N 187751
10.5%
I 158854
 
8.8%
O 102604
 
5.7%
L 96914
 
5.4%
D 96078
 
5.4%
T 94941
 
5.3%
M 85034
 
4.7%
S 73385
 
4.1%
Other values (14) 428633
23.9%

NAICS
Real number (ℝ)

Zeros 

Distinct1312
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean398660.95
Minimum0
Maximum928120
Zeros201948
Zeros (%)22.5%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:31.158755image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1235210
median445310
Q3561730
95-th percentile811192
Maximum928120
Range928120
Interquartile range (IQR)326520

Descriptive statistics

Standard deviation263318.31
Coefficient of variation (CV)0.66050691
Kurtosis-1.0476526
Mean398660.95
Median Absolute Deviation (MAD)176300
Skewness-0.26287834
Sum3.5846157 × 1011
Variance6.9336534 × 1010
MonotonicityNot monotonic
2025-02-09T09:23:31.245431image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 201948
 
22.5%
722110 27989
 
3.1%
722211 19448
 
2.2%
811111 14585
 
1.6%
621210 14048
 
1.6%
624410 10111
 
1.1%
812112 9230
 
1.0%
561730 8935
 
1.0%
621310 8733
 
1.0%
812320 7894
 
0.9%
Other values (1302) 576243
64.1%
ValueCountFrequency (%)
0 201948
22.5%
111110 32
 
< 0.1%
111120 3
 
< 0.1%
111130 1
 
< 0.1%
111140 94
 
< 0.1%
111150 49
 
< 0.1%
111160 2
 
< 0.1%
111191 3
 
< 0.1%
111199 7
 
< 0.1%
111211 16
 
< 0.1%
ValueCountFrequency (%)
928120 32
< 0.1%
928110 4
 
< 0.1%
927110 1
 
< 0.1%
926150 10
 
< 0.1%
926140 6
 
< 0.1%
926130 3
 
< 0.1%
926120 5
 
< 0.1%
926110 6
 
< 0.1%
925120 1
 
< 0.1%
925110 3
 
< 0.1%
Distinct9859
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
Minimum1975-01-20 00:00:00
Maximum2074-12-17 00:00:00
Invalid dates0
Invalid dates (%)0.0%
2025-02-09T09:23:31.330274image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:31.417794image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

ApprovalFY
Unsupported

Rejected  Unsupported 

Missing0
Missing (%)0.0%
Memory size6.9 MiB

Term
Real number (ℝ)

Distinct412
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.77308
Minimum0
Maximum569
Zeros810
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:31.501085image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16
Q160
median84
Q3120
95-th percentile300
Maximum569
Range569
Interquartile range (IQR)60

Descriptive statistics

Standard deviation78.857305
Coefficient of variation (CV)0.7118815
Kurtosis0.18570424
Mean110.77308
Median Absolute Deviation (MAD)33
Skewness1.1209258
Sum99603164
Variance6218.4746
MonotonicityNot monotonic
2025-02-09T09:23:31.586965image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
84 230162
25.6%
60 89945
 
10.0%
240 85982
 
9.6%
120 77654
 
8.6%
300 44727
 
5.0%
180 28164
 
3.1%
36 19800
 
2.2%
12 17095
 
1.9%
48 15621
 
1.7%
72 9419
 
1.0%
Other values (402) 280595
31.2%
ValueCountFrequency (%)
0 810
 
0.1%
1 1608
0.2%
2 1809
0.2%
3 2112
0.2%
4 2173
0.2%
5 1866
0.2%
6 3054
0.3%
7 1761
0.2%
8 1693
0.2%
9 1875
0.2%
ValueCountFrequency (%)
569 1
< 0.1%
527 1
< 0.1%
511 1
< 0.1%
505 1
< 0.1%
481 1
< 0.1%
480 1
< 0.1%
461 1
< 0.1%
449 1
< 0.1%
445 1
< 0.1%
443 1
< 0.1%

NoEmp
Real number (ℝ)

Skewed 

Distinct599
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.411353
Minimum0
Maximum9999
Zeros6631
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:31.674056image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q310
95-th percentile40
Maximum9999
Range9999
Interquartile range (IQR)8

Descriptive statistics

Standard deviation74.108196
Coefficient of variation (CV)6.4942514
Kurtosis7965.2886
Mean11.411353
Median Absolute Deviation (MAD)3
Skewness80.248244
Sum10260678
Variance5492.0248
MonotonicityNot monotonic
2025-02-09T09:23:31.848605image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 154254
17.2%
2 138297
15.4%
3 90674
10.1%
4 73644
 
8.2%
5 60319
 
6.7%
6 45759
 
5.1%
10 31536
 
3.5%
7 31495
 
3.5%
8 31361
 
3.5%
12 20822
 
2.3%
Other values (589) 221003
24.6%
ValueCountFrequency (%)
0 6631
 
0.7%
1 154254
17.2%
2 138297
15.4%
3 90674
10.1%
4 73644
8.2%
5 60319
 
6.7%
6 45759
 
5.1%
7 31495
 
3.5%
8 31361
 
3.5%
9 18131
 
2.0%
ValueCountFrequency (%)
9999 4
< 0.1%
9992 1
 
< 0.1%
9945 1
 
< 0.1%
9090 1
 
< 0.1%
9000 2
 
< 0.1%
8500 1
 
< 0.1%
8041 1
 
< 0.1%
8018 1
 
< 0.1%
8000 7
< 0.1%
7999 1
 
< 0.1%

NewExist
Categorical

Distinct3
Distinct (%)< 0.1%
Missing136
Missing (%)< 0.1%
Memory size6.9 MiB
1.0
644869 
2.0
253125 
0.0
 
1034

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters2697084
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row2.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0 644869
71.7%
2.0 253125
 
28.2%
0.0 1034
 
0.1%
(Missing) 136
 
< 0.1%

Length

2025-02-09T09:23:31.921497image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-02-09T09:23:32.021538image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
1.0 644869
71.7%
2.0 253125
 
28.2%
0.0 1034
 
0.1%

Most occurring characters

ValueCountFrequency (%)
0 900062
33.4%
. 899028
33.3%
1 644869
23.9%
2 253125
 
9.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2697084
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 900062
33.4%
. 899028
33.3%
1 644869
23.9%
2 253125
 
9.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2697084
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 900062
33.4%
. 899028
33.3%
1 644869
23.9%
2 253125
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2697084
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 900062
33.4%
. 899028
33.3%
1 644869
23.9%
2 253125
 
9.4%

CreateJob
Real number (ℝ)

Skewed  Zeros 

Distinct246
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.4303764
Minimum0
Maximum8800
Zeros629248
Zeros (%)70.0%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:32.097857image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile10
Maximum8800
Range8800
Interquartile range (IQR)1

Descriptive statistics

Standard deviation236.68817
Coefficient of variation (CV)28.075634
Kurtosis1369.911
Mean8.4303764
Median Absolute Deviation (MAD)0
Skewness36.991355
Sum7580291
Variance56021.288
MonotonicityNot monotonic
2025-02-09T09:23:32.188128image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 629248
70.0%
1 63174
 
7.0%
2 57831
 
6.4%
3 28806
 
3.2%
4 20511
 
2.3%
5 18691
 
2.1%
10 11602
 
1.3%
6 11009
 
1.2%
8 7378
 
0.8%
7 6374
 
0.7%
Other values (236) 44540
 
5.0%
ValueCountFrequency (%)
0 629248
70.0%
1 63174
 
7.0%
2 57831
 
6.4%
3 28806
 
3.2%
4 20511
 
2.3%
5 18691
 
2.1%
6 11009
 
1.2%
7 6374
 
0.7%
8 7378
 
0.8%
9 3330
 
0.4%
ValueCountFrequency (%)
8800 648
0.1%
5621 1
 
< 0.1%
5199 1
 
< 0.1%
5085 1
 
< 0.1%
3500 1
 
< 0.1%
3100 1
 
< 0.1%
3000 4
 
< 0.1%
2515 1
 
< 0.1%
2140 1
 
< 0.1%
2020 1
 
< 0.1%

RetainedJob
Real number (ℝ)

Skewed  Zeros 

Distinct358
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.797257
Minimum0
Maximum9500
Zeros440403
Zeros (%)49.0%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:32.289071image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34
95-th percentile20
Maximum9500
Range9500
Interquartile range (IQR)4

Descriptive statistics

Standard deviation237.1206
Coefficient of variation (CV)21.961188
Kurtosis1362.0182
Mean10.797257
Median Absolute Deviation (MAD)1
Skewness36.854812
Sum9708505
Variance56226.179
MonotonicityNot monotonic
2025-02-09T09:23:32.383721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 440403
49.0%
1 88790
 
9.9%
2 76851
 
8.5%
3 49963
 
5.6%
4 39666
 
4.4%
5 32627
 
3.6%
6 23796
 
2.6%
7 16530
 
1.8%
8 15698
 
1.7%
10 15438
 
1.7%
Other values (348) 99402
 
11.1%
ValueCountFrequency (%)
0 440403
49.0%
1 88790
 
9.9%
2 76851
 
8.5%
3 49963
 
5.6%
4 39666
 
4.4%
5 32627
 
3.6%
6 23796
 
2.6%
7 16530
 
1.8%
8 15698
 
1.7%
9 8735
 
1.0%
ValueCountFrequency (%)
9500 1
 
< 0.1%
8800 648
0.1%
7250 1
 
< 0.1%
5000 1
 
< 0.1%
4441 1
 
< 0.1%
4000 2
 
< 0.1%
3900 1
 
< 0.1%
3860 1
 
< 0.1%
3225 1
 
< 0.1%
3200 1
 
< 0.1%

FranchiseCode
Real number (ℝ)

Zeros 

Distinct2768
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2753.7259
Minimum0
Maximum99999
Zeros208835
Zeros (%)23.2%
Negative0
Negative (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:32.474873image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q31
95-th percentile15805
Maximum99999
Range99999
Interquartile range (IQR)0

Descriptive statistics

Standard deviation12758.019
Coefficient of variation (CV)4.6330025
Kurtosis24.409524
Mean2753.7259
Median Absolute Deviation (MAD)0
Skewness4.9752152
Sum2.4760512 × 109
Variance1.6276705 × 108
MonotonicityNot monotonic
2025-02-09T09:23:32.567573image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 638554
71.0%
0 208835
 
23.2%
78760 3373
 
0.4%
68020 1921
 
0.2%
50564 1034
 
0.1%
21780 1003
 
0.1%
25650 715
 
0.1%
79140 659
 
0.1%
22470 615
 
0.1%
17998 606
 
0.1%
Other values (2758) 41849
 
4.7%
ValueCountFrequency (%)
0 208835
 
23.2%
1 638554
71.0%
3 12
 
< 0.1%
395 5
 
< 0.1%
399 3
 
< 0.1%
400 2
 
< 0.1%
401 12
 
< 0.1%
404 1
 
< 0.1%
407 34
 
< 0.1%
414 2
 
< 0.1%
ValueCountFrequency (%)
99999 1
 
< 0.1%
92006 4
 
< 0.1%
92000 9
< 0.1%
91999 11
< 0.1%
91450 2
 
< 0.1%
91446 1
 
< 0.1%
91443 2
 
< 0.1%
91435 1
 
< 0.1%
91424 1
 
< 0.1%
91423 2
 
< 0.1%

UrbanRural
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
1
470654 
0
323167 
2
105343 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters899164
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

Length

2025-02-09T09:23:32.644634image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-02-09T09:23:32.690139image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

Most occurring characters

ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 899164
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 899164
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 899164
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1 470654
52.3%
0 323167
35.9%
2 105343
 
11.7%

RevLineCr
Categorical

Imbalance 

Distinct18
Distinct (%)< 0.1%
Missing4528
Missing (%)0.5%
Memory size6.9 MiB
N
420288 
0
257602 
Y
201397 
T
 
15284
1
 
23
Other values (13)
 
42

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters894636
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 420288
46.7%
0 257602
28.6%
Y 201397
22.4%
T 15284
 
1.7%
1 23
 
< 0.1%
R 14
 
< 0.1%
` 11
 
< 0.1%
2 6
 
< 0.1%
C 2
 
< 0.1%
, 1
 
< 0.1%
Other values (8) 8
 
< 0.1%
(Missing) 4528
 
0.5%

Length

2025-02-09T09:23:32.747333image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
n 420288
47.0%
0 257602
28.8%
y 201397
22.5%
t 15284
 
1.7%
1 23
 
< 0.1%
r 14
 
< 0.1%
14
 
< 0.1%
2 6
 
< 0.1%
c 2
 
< 0.1%
3 1
 
< 0.1%
Other values (5) 5
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N 420288
47.0%
0 257602
28.8%
Y 201397
22.5%
T 15284
 
1.7%
1 23
 
< 0.1%
R 14
 
< 0.1%
` 11
 
< 0.1%
2 6
 
< 0.1%
C 2
 
< 0.1%
, 1
 
< 0.1%
Other values (8) 8
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 894636
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 420288
47.0%
0 257602
28.8%
Y 201397
22.5%
T 15284
 
1.7%
1 23
 
< 0.1%
R 14
 
< 0.1%
` 11
 
< 0.1%
2 6
 
< 0.1%
C 2
 
< 0.1%
, 1
 
< 0.1%
Other values (8) 8
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 894636
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 420288
47.0%
0 257602
28.8%
Y 201397
22.5%
T 15284
 
1.7%
1 23
 
< 0.1%
R 14
 
< 0.1%
` 11
 
< 0.1%
2 6
 
< 0.1%
C 2
 
< 0.1%
, 1
 
< 0.1%
Other values (8) 8
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 894636
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 420288
47.0%
0 257602
28.8%
Y 201397
22.5%
T 15284
 
1.7%
1 23
 
< 0.1%
R 14
 
< 0.1%
` 11
 
< 0.1%
2 6
 
< 0.1%
C 2
 
< 0.1%
, 1
 
< 0.1%
Other values (8) 8
 
< 0.1%

LowDoc
Categorical

Imbalance 

Distinct8
Distinct (%)< 0.1%
Missing2582
Missing (%)0.3%
Memory size6.9 MiB
N
782822 
Y
110335 
0
 
1491
C
 
758
S
 
603
Other values (3)
 
573

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters896582
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowY
2nd rowY
3rd rowN
4th rowY
5th rowN

Common Values

ValueCountFrequency (%)
N 782822
87.1%
Y 110335
 
12.3%
0 1491
 
0.2%
C 758
 
0.1%
S 603
 
0.1%
A 497
 
0.1%
R 75
 
< 0.1%
1 1
 
< 0.1%
(Missing) 2582
 
0.3%

Length

2025-02-09T09:23:32.809380image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-02-09T09:23:32.865133image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
n 782822
87.3%
y 110335
 
12.3%
0 1491
 
0.2%
c 758
 
0.1%
s 603
 
0.1%
a 497
 
0.1%
r 75
 
< 0.1%
1 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N 782822
87.3%
Y 110335
 
12.3%
0 1491
 
0.2%
C 758
 
0.1%
S 603
 
0.1%
A 497
 
0.1%
R 75
 
< 0.1%
1 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 896582
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 782822
87.3%
Y 110335
 
12.3%
0 1491
 
0.2%
C 758
 
0.1%
S 603
 
0.1%
A 497
 
0.1%
R 75
 
< 0.1%
1 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 896582
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 782822
87.3%
Y 110335
 
12.3%
0 1491
 
0.2%
C 758
 
0.1%
S 603
 
0.1%
A 497
 
0.1%
R 75
 
< 0.1%
1 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 896582
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 782822
87.3%
Y 110335
 
12.3%
0 1491
 
0.2%
C 758
 
0.1%
S 603
 
0.1%
A 497
 
0.1%
R 75
 
< 0.1%
1 1
 
< 0.1%

ChgOffDate
Date

Missing 

Distinct6448
Distinct (%)4.0%
Missing736465
Missing (%)81.9%
Memory size6.9 MiB
Minimum1988-10-03 00:00:00
Maximum2026-10-22 00:00:00
Invalid dates0
Invalid dates (%)0.0%
2025-02-09T09:23:32.957101image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:33.055715image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct8472
Distinct (%)0.9%
Missing2368
Missing (%)0.3%
Memory size6.9 MiB
Minimum1975-01-17 00:00:00
Maximum2074-12-04 00:00:00
Invalid dates0
Invalid dates (%)0.0%
2025-02-09T09:23:33.148623image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:33.239389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct118859
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:33.505652image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length15
Median length14
Mean length11.537586
Min length6

Characters and Unicode

Total characters10374182
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79785 ?
Unique (%)8.9%

Sample

1st row$60,000.00
2nd row$40,000.00
3rd row$287,000.00
4th row$35,000.00
5th row$229,000.00
ValueCountFrequency (%)
50,000.00 43787
 
4.9%
100,000.00 36714
 
4.1%
25,000.00 27387
 
3.0%
150,000.00 23373
 
2.6%
10,000.00 21328
 
2.4%
35,000.00 14748
 
1.6%
5,000.00 14193
 
1.6%
75,000.00 13528
 
1.5%
20,000.00 13462
 
1.5%
30,000.00 12696
 
1.4%
Other values (118849) 677948
75.4%
2025-02-09T09:23:33.838739image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4457089
43.0%
, 924978
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 445569
 
4.3%
1 409947
 
4.0%
2 312909
 
3.0%
3 238773
 
2.3%
4 207077
 
2.0%
Other values (4) 680348
 
6.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10374182
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 4457089
43.0%
, 924978
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 445569
 
4.3%
1 409947
 
4.0%
2 312909
 
3.0%
3 238773
 
2.3%
4 207077
 
2.0%
Other values (4) 680348
 
6.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10374182
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 4457089
43.0%
, 924978
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 445569
 
4.3%
1 409947
 
4.0%
2 312909
 
3.0%
3 238773
 
2.3%
4 207077
 
2.0%
Other values (4) 680348
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10374182
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 4457089
43.0%
, 924978
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 445569
 
4.3%
1 409947
 
4.0%
2 312909
 
3.0%
3 238773
 
2.3%
4 207077
 
2.0%
Other values (4) 680348
 
6.6%

BalanceGross
Categorical

Imbalance 

Distinct15
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
$0.00
899150 
$12,750.00
 
1
$827,875.00
 
1
$25,000.00
 
1
$37,100.00
 
1
Other values (10)
 
10

Length

Max length12
Median length6
Mean length6.0000767
Min length6

Characters and Unicode

Total characters5395053
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)< 0.1%

Sample

1st row$0.00
2nd row$0.00
3rd row$0.00
4th row$0.00
5th row$0.00

Common Values

ValueCountFrequency (%)
$0.00 899150
> 99.9%
$12,750.00 1
 
< 0.1%
$827,875.00 1
 
< 0.1%
$25,000.00 1
 
< 0.1%
$37,100.00 1
 
< 0.1%
$43,127.00 1
 
< 0.1%
$84,617.00 1
 
< 0.1%
$1,760.00 1
 
< 0.1%
$115,820.00 1
 
< 0.1%
$996,262.00 1
 
< 0.1%
Other values (5) 5
 
< 0.1%

Length

2025-02-09T09:23:33.918731image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0.00 899150
> 99.9%
12,750.00 1
 
< 0.1%
827,875.00 1
 
< 0.1%
25,000.00 1
 
< 0.1%
37,100.00 1
 
< 0.1%
43,127.00 1
 
< 0.1%
84,617.00 1
 
< 0.1%
1,760.00 1
 
< 0.1%
115,820.00 1
 
< 0.1%
996,262.00 1
 
< 0.1%
Other values (5) 5
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0 2697490
50.0%
$ 899164
 
16.7%
. 899164
 
16.7%
899164
 
16.7%
, 13
 
< 0.1%
1 11
 
< 0.1%
7 8
 
< 0.1%
2 7
 
< 0.1%
6 7
 
< 0.1%
9 7
 
< 0.1%
Other values (4) 18
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5395053
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 2697490
50.0%
$ 899164
 
16.7%
. 899164
 
16.7%
899164
 
16.7%
, 13
 
< 0.1%
1 11
 
< 0.1%
7 8
 
< 0.1%
2 7
 
< 0.1%
6 7
 
< 0.1%
9 7
 
< 0.1%
Other values (4) 18
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5395053
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 2697490
50.0%
$ 899164
 
16.7%
. 899164
 
16.7%
899164
 
16.7%
, 13
 
< 0.1%
1 11
 
< 0.1%
7 8
 
< 0.1%
2 7
 
< 0.1%
6 7
 
< 0.1%
9 7
 
< 0.1%
Other values (4) 18
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5395053
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 2697490
50.0%
$ 899164
 
16.7%
. 899164
 
16.7%
899164
 
16.7%
, 13
 
< 0.1%
1 11
 
< 0.1%
7 8
 
< 0.1%
2 7
 
< 0.1%
6 7
 
< 0.1%
9 7
 
< 0.1%
Other values (4) 18
 
< 0.1%

MIS_Status
Categorical

Distinct2
Distinct (%)< 0.1%
Missing1997
Missing (%)0.2%
Memory size6.9 MiB
P I F
739609 
CHGOFF
157558 

Length

Max length6
Median length5
Mean length5.1756172
Min length5

Characters and Unicode

Total characters4643393
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP I F
2nd rowP I F
3rd rowP I F
4th rowP I F
5th rowP I F

Common Values

ValueCountFrequency (%)
P I F 739609
82.3%
CHGOFF 157558
 
17.5%
(Missing) 1997
 
0.2%

Length

2025-02-09T09:23:33.993652image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-02-09T09:23:34.042511image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
p 739609
31.1%
i 739609
31.1%
f 739609
31.1%
chgoff 157558
 
6.6%

Most occurring characters

ValueCountFrequency (%)
1479218
31.9%
F 1054725
22.7%
P 739609
15.9%
I 739609
15.9%
C 157558
 
3.4%
H 157558
 
3.4%
G 157558
 
3.4%
O 157558
 
3.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4643393
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1479218
31.9%
F 1054725
22.7%
P 739609
15.9%
I 739609
15.9%
C 157558
 
3.4%
H 157558
 
3.4%
G 157558
 
3.4%
O 157558
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4643393
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1479218
31.9%
F 1054725
22.7%
P 739609
15.9%
I 739609
15.9%
C 157558
 
3.4%
H 157558
 
3.4%
G 157558
 
3.4%
O 157558
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4643393
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1479218
31.9%
F 1054725
22.7%
P 739609
15.9%
I 739609
15.9%
C 157558
 
3.4%
H 157558
 
3.4%
G 157558
 
3.4%
O 157558
 
3.4%
Distinct83165
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:34.224628image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length14
Median length6
Mean length6.8997235
Min length6

Characters and Unicode

Total characters6203983
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52342 ?
Unique (%)5.8%

Sample

1st row$0.00
2nd row$0.00
3rd row$0.00
4th row$0.00
5th row$0.00
ValueCountFrequency (%)
0.00 737152
82.0%
50,000.00 2110
 
0.2%
10,000.00 1865
 
0.2%
25,000.00 1371
 
0.2%
35,000.00 1345
 
0.1%
100,000.00 1028
 
0.1%
20,000.00 594
 
0.1%
30,000.00 492
 
0.1%
15,000.00 467
 
0.1%
5,000.00 356
 
< 0.1%
Other values (83155) 152384
 
16.9%
2025-02-09T09:23:34.505421image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2643222
42.6%
$ 899164
 
14.5%
. 899164
 
14.5%
899164
 
14.5%
, 161591
 
2.6%
1 98607
 
1.6%
2 88727
 
1.4%
4 86077
 
1.4%
9 81470
 
1.3%
3 79226
 
1.3%
Other values (4) 267571
 
4.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6203983
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 2643222
42.6%
$ 899164
 
14.5%
. 899164
 
14.5%
899164
 
14.5%
, 161591
 
2.6%
1 98607
 
1.6%
2 88727
 
1.4%
4 86077
 
1.4%
9 81470
 
1.3%
3 79226
 
1.3%
Other values (4) 267571
 
4.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6203983
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 2643222
42.6%
$ 899164
 
14.5%
. 899164
 
14.5%
899164
 
14.5%
, 161591
 
2.6%
1 98607
 
1.6%
2 88727
 
1.4%
4 86077
 
1.4%
9 81470
 
1.3%
3 79226
 
1.3%
Other values (4) 267571
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6203983
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 2643222
42.6%
$ 899164
 
14.5%
. 899164
 
14.5%
899164
 
14.5%
, 161591
 
2.6%
1 98607
 
1.6%
2 88727
 
1.4%
4 86077
 
1.4%
9 81470
 
1.3%
3 79226
 
1.3%
Other values (4) 267571
 
4.3%

GrAppv
Text

Distinct22128
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:34.669002image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.513319
Min length8

Characters and Unicode

Total characters10352362
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13651 ?
Unique (%)1.5%

Sample

1st row$60,000.00
2nd row$40,000.00
3rd row$287,000.00
4th row$35,000.00
5th row$229,000.00
ValueCountFrequency (%)
50,000.00 69394
 
7.7%
25,000.00 51258
 
5.7%
100,000.00 50977
 
5.7%
10,000.00 38366
 
4.3%
150,000.00 27624
 
3.1%
20,000.00 23434
 
2.6%
35,000.00 23181
 
2.6%
30,000.00 21004
 
2.3%
5,000.00 19146
 
2.1%
15,000.00 18472
 
2.1%
Other values (22118) 556308
61.9%
2025-02-09T09:23:34.921694image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4946152
47.8%
, 925342
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 450225
 
4.3%
1 345271
 
3.3%
2 266534
 
2.6%
3 180629
 
1.7%
4 133995
 
1.3%
Other values (4) 406722
 
3.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10352362
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 4946152
47.8%
, 925342
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 450225
 
4.3%
1 345271
 
3.3%
2 266534
 
2.6%
3 180629
 
1.7%
4 133995
 
1.3%
Other values (4) 406722
 
3.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10352362
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 4946152
47.8%
, 925342
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 450225
 
4.3%
1 345271
 
3.3%
2 266534
 
2.6%
3 180629
 
1.7%
4 133995
 
1.3%
Other values (4) 406722
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10352362
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 4946152
47.8%
, 925342
 
8.9%
. 899164
 
8.7%
$ 899164
 
8.7%
899164
 
8.7%
5 450225
 
4.3%
1 345271
 
3.3%
2 266534
 
2.6%
3 180629
 
1.7%
4 133995
 
1.3%
Other values (4) 406722
 
3.9%
Distinct38326
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size6.9 MiB
2025-02-09T09:23:35.136076image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length14
Median length11
Mean length11.308074
Min length8

Characters and Unicode

Total characters10167813
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23260 ?
Unique (%)2.6%

Sample

1st row$48,000.00
2nd row$32,000.00
3rd row$215,250.00
4th row$28,000.00
5th row$229,000.00
ValueCountFrequency (%)
25,000.00 49579
 
5.5%
12,500.00 40147
 
4.5%
5,000.00 31135
 
3.5%
50,000.00 25047
 
2.8%
10,000.00 17009
 
1.9%
17,500.00 16141
 
1.8%
15,000.00 14490
 
1.6%
7,500.00 12781
 
1.4%
127,500.00 11946
 
1.3%
80,000.00 10965
 
1.2%
Other values (38316) 669924
74.5%
2025-02-09T09:23:35.549215image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4048030
39.8%
, 908994
 
8.9%
. 899164
 
8.8%
$ 899164
 
8.8%
899164
 
8.8%
5 654346
 
6.4%
2 433556
 
4.3%
1 386969
 
3.8%
7 251493
 
2.5%
3 186643
 
1.8%
Other values (4) 600290
 
5.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10167813
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 4048030
39.8%
, 908994
 
8.9%
. 899164
 
8.8%
$ 899164
 
8.8%
899164
 
8.8%
5 654346
 
6.4%
2 433556
 
4.3%
1 386969
 
3.8%
7 251493
 
2.5%
3 186643
 
1.8%
Other values (4) 600290
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10167813
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 4048030
39.8%
, 908994
 
8.9%
. 899164
 
8.8%
$ 899164
 
8.8%
899164
 
8.8%
5 654346
 
6.4%
2 433556
 
4.3%
1 386969
 
3.8%
7 251493
 
2.5%
3 186643
 
1.8%
Other values (4) 600290
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10167813
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 4048030
39.8%
, 908994
 
8.9%
. 899164
 
8.8%
$ 899164
 
8.8%
899164
 
8.8%
5 654346
 
6.4%
2 433556
 
4.3%
1 386969
 
3.8%
7 251493
 
2.5%
3 186643
 
1.8%
Other values (4) 600290
 
5.9%

Interactions

2025-02-09T09:23:21.322984image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:13.814784image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.884291image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.002075image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.088454image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.194032image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.247772image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.306407image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.454383image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:13.951609image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.021013image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.139868image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.224285image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.325335image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.381479image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.431773image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.591285image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.086245image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.163008image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.280079image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.358288image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.456715image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.507095image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.560195image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.722728image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.215845image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.300880image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.412861image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.492531image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.586655image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.652867image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.685549image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.856314image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.351017image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.438002image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.546225image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.626491image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.712245image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.789122image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.814918image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.991643image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.482572image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.579249image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.678826image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.764580image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.844477image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.914124image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.943663image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:22.120509image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.611856image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.715488image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.811442image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:17.919282image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.982930image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.042437image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.067817image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:22.261345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:14.739752image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:15.858167image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:16.946864image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:18.056033image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:19.112746image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:20.175290image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-02-09T09:23:21.192757image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-02-09T09:23:35.624169image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
BalanceGrossCreateJobFranchiseCodeLoanNr_ChkDgtLowDocMIS_StatusNAICSNewExistNoEmpRetainedJobRevLineCrTermUrbanRuralZip
BalanceGross1.0000.0000.0050.0010.0000.0000.0010.0000.0000.0000.0000.0000.0020.001
CreateJob0.0001.000-0.054-0.0310.0030.0120.1570.0090.0340.3770.0110.0820.0250.026
FranchiseCode0.005-0.0541.0000.3920.0140.022-0.0910.0990.121-0.2630.0440.1960.0130.031
LoanNr_ChkDgt0.001-0.0310.3921.0000.1100.237-0.0500.0620.075-0.1420.0840.1210.1890.031
LowDoc0.0000.0030.0140.1101.0000.0880.0600.1160.0000.0030.0870.0680.1570.059
MIS_Status0.0000.0120.0220.2370.0881.0000.1480.0220.0040.0130.1460.4920.2110.081
NAICS0.0010.157-0.091-0.0500.0600.1481.0000.094-0.1540.2710.124-0.0810.432-0.034
NewExist0.0000.0090.0990.0620.1160.0220.0941.0000.0050.0020.0650.0880.0300.088
NoEmp0.0000.0340.1210.0750.0000.004-0.1540.0051.0000.1240.0000.2000.0100.059
RetainedJob0.0000.377-0.263-0.1420.0030.0130.2710.0020.1241.0000.010-0.1570.025-0.026
RevLineCr0.0000.0110.0440.0840.0870.1460.1240.0650.0000.0101.0000.1400.3480.056
Term0.0000.0820.1960.1210.0680.492-0.0810.0880.200-0.1570.1401.0000.2070.142
UrbanRural0.0020.0250.0130.1890.1570.2110.4320.0300.0100.0250.3480.2071.0000.126
Zip0.0010.0260.0310.0310.0590.081-0.0340.0880.059-0.0260.0560.1420.1261.000

Missing values

2025-02-09T09:23:22.844803image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-02-09T09:23:24.254314image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-02-09T09:23:27.184809image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

LoanNr_ChkDgtNameCityStateZipBankBankStateNAICSApprovalDateApprovalFYTermNoEmpNewExistCreateJobRetainedJobFranchiseCodeUrbanRuralRevLineCrLowDocChgOffDateDisbursementDateDisbursementGrossBalanceGrossMIS_StatusChgOffPrinGrGrAppvSBA_Appv
01000014003ABC HOBBYCRAFTEVANSVILLEIN47711FIFTH THIRD BANKOH45112028-Feb-9719978442.00010NYNaN28-Feb-99$60,000.00$0.00P I F$0.00$60,000.00$48,000.00
11000024006LANDMARK BAR & GRILLE (THE)NEW PARISIN465261ST SOURCE BANKIN72241028-Feb-9719976022.00010NYNaN31-May-97$40,000.00$0.00P I F$0.00$40,000.00$32,000.00
21000034009WHITLOCK DDS, TODD M.BLOOMINGTONIN47401GRANT COUNTY STATE BANKIN62121028-Feb-97199718071.00010NNNaN31-Dec-97$287,000.00$0.00P I F$0.00$287,000.00$215,250.00
31000044001BIG BUCKS PAWN & JEWELRY, LLCBROKEN ARROWOK740121ST NATL BK & TR CO OF BROKENOK028-Feb-9719976021.00010NYNaN30-Jun-97$35,000.00$0.00P I F$0.00$35,000.00$28,000.00
41000054004ANASTASIA CONFECTIONS, INC.ORLANDOFL32801FLORIDA BUS. DEVEL CORPFL028-Feb-971997240141.07710NNNaN14-May-97$229,000.00$0.00P I F$0.00$229,000.00$229,000.00
51000084002B&T SCREW MACHINE COMPANY, INCPLAINVILLECT6062TD BANK, NATIONAL ASSOCIATIONDE33272128-Feb-971997120191.00010NNNaN30-Jun-97$517,000.00$0.00P I F$0.00$517,000.00$387,750.00
61000093009MIDDLE ATLANTIC SPORTS CO INCUNIONNJ7083WELLS FARGO BANK NATL ASSOCSD02-Jun-80198045452.00000NN24-Jun-9122-Jul-80$600,000.00$0.00CHGOFF$208,959.00$600,000.00$499,998.00
71000094005WEAVER PRODUCTSSUMMERFIELDFL34491REGIONS BANKAL81111828-Feb-9719978412.00010NYNaN30-Jun-98$45,000.00$0.00P I F$0.00$45,000.00$36,000.00
81000104006TURTLE BEACH INNPORT SAINT JOEFL32456CENTENNIAL BANKFL72131028-Feb-97199729722.00010NNNaN31-Jul-97$305,000.00$0.00P I F$0.00$305,000.00$228,750.00
91000124001INTEXT BUILDING SYS LLCGLASTONBURYCT6073WEBSTER BANK NATL ASSOCCT028-Feb-9719978432.00010NYNaN30-Apr-97$70,000.00$0.00P I F$0.00$70,000.00$56,000.00
LoanNr_ChkDgtNameCityStateZipBankBankStateNAICSApprovalDateApprovalFYTermNoEmpNewExistCreateJobRetainedJobFranchiseCodeUrbanRuralRevLineCrLowDocChgOffDateDisbursementDateDisbursementGrossBalanceGrossMIS_StatusChgOffPrinGrGrAppvSBA_Appv
8991549995423005LITWIN LIVERY SERVICES, INC.CAMPBELLOH44405JPMORGAN CHASE BANK NATL ASSOCIL027-Feb-9719976011.000100NNaN30-Sep-97$10,000.00$0.00P I F$0.00$10,000.00$5,000.00
8991559995453003FUTURE LEADERS CENTER, INC.SO. OZONE PARKNY11420FLUSHING BANKNY62441027-Feb-97199718021.000100NNaN30-Jun-97$123,000.00$0.00P I F$0.00$128,000.00$96,000.00
8991569995473009FABRICATORS STEEL, INC.BALTIMOREMD21224BANK OF AMERICA NATL ASSOCMD33243127-Feb-97199760201.000100NNaN30-Jun-97$50,000.00$0.00P I F$0.00$50,000.00$25,000.00
8991579995493004PULLTARPS MFG.EL CAJONCA92020U.S. BANK NATIONAL ASSOCIATIONCA31491227-Feb-97199736401.00010NNNaN31-Mar-97$200,000.00$0.00P I F$0.00$200,000.00$150,000.00
8991589995563001SHADES WINDOW TINTING AUTO ALAIRVINGTX75062LOANS FROM OLD CLOSED LENDERSDC027-Feb-9719978452.00010NYNaN30-Jun-97$79,000.00$0.00P I F$0.00$79,000.00$63,200.00
8991599995573004FABRIC FARMSUPPER ARLINGTONOH43221JPMORGAN CHASE BANK NATL ASSOCIL45112027-Feb-9719976061.000100NNaN30-Sep-97$70,000.00$0.00P I F$0.00$70,000.00$56,000.00
8991609995603000FABRIC FARMSCOLUMBUSOH43221JPMORGAN CHASE BANK NATL ASSOCIL45113027-Feb-9719976061.00010YNNaN31-Oct-97$85,000.00$0.00P I F$0.00$85,000.00$42,500.00
8991619995613003RADCO MANUFACTURING CO.,INC.SANTA MARIACA93455RABOBANK, NATIONAL ASSOCIATIONCA33232127-Feb-971997108261.00010NNNaN30-Sep-97$300,000.00$0.00P I F$0.00$300,000.00$225,000.00
8991629995973006MARUTAMA HAWAII, INC.HONOLULUHI96830BANK OF HAWAIIHI027-Feb-9719976061.00010NY8-Mar-0031-Mar-97$75,000.00$0.00CHGOFF$46,383.00$75,000.00$60,000.00
8991639996003010PACIFIC TRADEWINDS FAN & LIGHTKAILUAHI96734CENTRAL PACIFIC BANKHI027-Feb-9719974812.00010NNNaN31-May-97$30,000.00$0.00P I F$0.00$30,000.00$24,000.00